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Abstract 

We consider the dynamic behavior of several variants of the Network Creation Game, 
introduced by Fabrikant et al. [PODC'03]. For this, we treat these games as sequential- 
move games and analyze whether selfish play eventually leads to an equilibrium state of 
the game. We show that fast convergence is guaranteed for all versions of Swap Games, 
if the initial network is a tree. Furthermore, we show that this process can be sped 
up to an almost optimal number of moves by employing a very natural move policy. 
Unfortunately, these positive results are no longer true if the initial network has cycles 
and we show that even one non-tree edge suffices to destroy the convergence-guarantee. 
Moreover, we show that on non-tree networks no move policy can enforce convergence. 
We extend our negative results to the well-studied original version, where agents are 
allowed to buy and delete edges. That is, we show that in the original model there is no 
convergence guarantee - even if all agents play optimally. 



1 Introduction 

The Internet is arguably one of the most important artificially created networks of our time. 
Understanding Internet-like networks and their implications on our life is a recent endeavor 
undertaken by researchers from different research communities and seems a very important 
and challenging task. Such networks are difficult to analyze since they are created by a 
multitude of selfish entities (e.g. Internet Service Providers) which modify the infrastructure 
of parts of the network (e.g. their Autonomous Systems) to improve their service quality. The 
classical field of Game Theory provides the tools for analyzing such decentralized processes 
and from this perspective the Internet can be seen as an equilibrium state of an underlying 
game played by selfish agents. 

Within the last decade several such games have been proposed and analyzed. We will 
focus on the line of works which consider Network Creation Games, as introduced by Fab- 



rikant et al. 11 . These games are very simple but they contain an interesting trade-off 
between an agent's investment in infrastructure and her obtained usage quality. Agents aim 
to invest as little as possible but at the same time they want to achieve a good connection 
to all other agents in the network. Network Creation Games and several variants have been 
studied intensively, but, to the best of our knowledge, almost all these works exclusively fo- 
cus on properties of the equilibrium states of the game. With this focus, the game is usually 
considered to be a one-shot simultaneous-move game. However, the Internet was not created 
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in "one shot". It has rather evolved from an initial network, the ARPANET, into its current 
shape by repeated infrastructural changes performed by selfish agents who entered or left the 
stage at some time in the process. For this reason, we focus on a more dynamic point of view: 
We analyze the properties of the network creation processes induced by the sequential-move 
version of the known models of selfish network creation. 

1.1 Models and Definitions 

We consider several versions of a network creation process performed by n selfish agents. In all 
versions we consider networks, where every node corresponds to an agent and undirected links 
connect network nodes. The creation process is based on an underlying Network Creation 
Game (NCG) and can be understood as a dynamic process where agents sequentially perform 
strategy-changes in the NCG. In such games, the strategies of the agents determine which 
links are present in the network and any strategy-profile, which is a vector of the strategies 
of all n agents, determines the induced network. But this also works the other way round: 
Given some network G = (V, E, o), where V is the set of n vertices, E is the set of edges and 
o : E — > V is the ownership-function, which assigns the ownership of an edge to one of its 
endpoints, then G completely determines the current strategies of all n agents of the NCG. 
Starting from a network Go, any sequence of strategy-changes by agents can thus be seen as 
a sequence of networks Go, Gi, G2, ■ ■ ■ , where the network Gj+i arises from the network Gi 
by the strategy-change of exactly one agent. In the following, we will write xy or yx for the 
undirected edge {x,y} £ E. In figures we will indicate edge-ownership by directing edges 
away from their owner. 

The creation process starts in an initial state Go, which we call the initial network. A 
step from state Gj to state Gj+i consists of a move by one agent. A move of agent u in state 
Gi is the replacement of agent u's pure strategy in Gj by another admissible pure strategy 
of agent u. The induced network after this strategy-change by agent u then corresponds to 
the state Gj+i. We consider only improving moves, that is, strategy-changes which strictly 
decrease the moving agent's cost. The cost of an agent in Gi depends on the structure of Gi 
and it will be defined formally below. If agent u in state Gj has an admissible new strategy 
which yields a strict cost decrease for her, then we call agent u unhappy in network Gi and 
we let Ui denote the set of all unhappy agents in state Gj. Only one agent can actually move 
in a state of the process and this agent u S Ui, whose move transforms Gj into Gj+i, is called 
the moving agent in state Gi. In any state of the process the identity of the moving agent is 
determined by the move policy of the process, which we will define below. The process stops 
in some state Gj if no agent wants to perform a move, that is, if Uj = 0, and we call the 
resulting networks stable. Clearly, stable networks correspond to pure Nash equilibria of the 
underlying NCG. 

Depending on what strategies are admissible for an agent in the current state, there are 
several variants of this process, which we call game types: 

• In the Swap Game (SG), introduced as "Basic Network Creation Game" by Alon et 
al. [2] , the strategy S u of an agent u in the network Gi is the set of neighbors of vertex 
u in Gj. The new strategy S* is admissible for agent u in state Gj, if \S U \ = \S*\ 
and \S U n 5*| = \S U \ — 1. Intuitively, admissible strategies in the SG are strategies 
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which replace one neighbor x of u by another vertex y. Note, that this corresponds to 
"swapping" the edge ux from x towards y, which is the replacement of edge ux by edge 
uy. Furthermore, observe, that in any state both endpoints of an edge are allowed to 
swap this edge. Technically, this means that the ownership of an edge has no influence 
on the agents' strategies. 



• The Asymmetric Swap Game (ASG), recently introduced by Mihalak and Schlegel 14 , 
is similar to the SG, but here the ownership of an edge plays a crucial role. Only the 
owner of an edge is allowed to swap the edge in any state of the process. The strategy 
S u of agent u in state Gi is the set of neighbors in G, to which u owns an edge and the 
strategy S* is admissible for agent u in state Gi, if \S U \ = \S*\ and \S U D = \S U \ — 1. 
Hence, in the ASG the moving agents are allowed to swap one own edge. 

• In the Greedy Buy Game (GBG), recently introduced by Lenzner |13| , agents have more 
freedom to act. In any state, an agent is allowed to buy or to delete or to swap one 
own edge. Hence, the GBG can be seen as an extension of the ASG. The strategy S u 
of agent u in state Gi is defined as in the ASG, but the set of admissible strategies is 
larger: S* is admissible for agent u in state Gi if (1) |5*| = \S U \ + 1 and S u C S* or (2) 
if |5*| = \S U \ - 1 and 5* C S u or (3) if \S U \ = \S*\ and \S U n S*\ = \S U \ - 1. 

• The Buy Game (BG), which is the original version of an NCG and which was introduced 
by Fabrikant et al. [11] , is the most general version. Here agents can perform arbitrary 
strategy-changes, that is, agents are allowed to perform any combination of buying, 
deleting and swapping of own edges. The strategy S u of agent u in Gi is defined as in 
the ASG, but an admissible strategy for agent u is any set S* C V \ {u}. 

The cost of an agent u in network Gi has the form cq^u) = eo^u) + Sq^u), where ec t (u) 
denotes the edge-cost and 5G i (u) denotes the distance-cost of agent u in the network G{. If 
edges have owners, then we assume that each edge has cost a > 0, which is a fixed constant, 
and this cost has to be paid fully by the owner. Hence, if agent it owns k edges in state 
Gi, then eG i { u ) = I n the SG, where edges do not have owners, we simply omit the 
edge-cost term in the cost function. There are two variants of distance-cost functions. In 
the SuM-version, we have o~Gi(u) = YlveVfGi) ^diu, v), if the network Gi is connected and 
6Gi(u) = oo, otherwise. In the MAX-version, we have 5a i ( u ) = max i>eV(G;) ddiu, v), if Gi 
is connected and Sq^u) = oo, otherwise. In both cases dGf{u,v) denotes the shortest path 
distance between vertex u and v in the undirected graph Gi. 

The move policy specifies for any state of the process, which of the unhappy agents is 
allowed to perform a move. From a mechanism design perspective, the move policy is a way 
to enforce coordination and to guide the process towards a stable state. We will focus on the 
max cost policy, where the agent having the highest cost is allowed to move and ties among 
such agents are broken arbitrarily. Sometimes we will assume that an adversary chooses the 
worst possible moving agent. Note, that the move policy only specifies who is allowed to 
move, not which specific move has to be performed. We do not consider such strong policies 
since we do not want to restrict the agents' freedom to act. 

Any combination of the four game types, the two distance functions and some move 
policy together with an initial network completely specifies a network creation process. We 
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will abbreviate the names, e.g. by calling the Buy Game with the Sum- version of the distance 
cost the Sum-BG. 

A cyclic sequence of networks C%, . . . ,Cj, where network Cj+i mo dj arises from network 

mod 

j by an improving move of one agent is called a better response cycle. If every move 
in such a cycle is a best response move, which is a strategy-change towards an admissible 
strategy which yields the largest cost decrease for the moving agent, then we call such a 
cycle a best response cycle. Clearly, a best response cycle is a better response cycle, but the 
existence of a better response cycle does not imply the existence of a best response cycle. 

1.2 Classifying Games According to their Dynamics 

Analyzing the convergence processes of games is a very rich and diverse research area. We 
will briefly introduce two well-known classes of finite strategic games: games having the finite 
improvement property (FIPG) [16] and weakly acyclic games (WAG) [17] . 

FIPG have the most desirable form of dynamic behavior: Starting from any initial state, 
every sequence of improving moves must eventually converge to an equilibrium state of the 
game, that is, such a sequence must have finite length. It was shown in [16] that a finite 
game is a FIPG if and only if there exists a generalized ordinal potential function which 
maps strategy-profiles to real numbers and has the property that if the moving agent's cost 
decreases, then the potential function value decreases as well. Stated in our terminology, this 
means that $> : Q n — > R, where Q n is the set of all networks on n nodes, and we have 

c Gl (u) ~ c Gl+1 {u) > =► *(Gi) - *(G i+ i) > 0, 

if agent u is the moving agent in the network Grj. Clearly, no FIPG can admit a better 
response cycle. 

An especially nice subclass of FIPG are games that are guaranteed to converge to a stable 
state in a number of steps which is a polynom of the size of the game. We call this subclass 
poly-FIPG. 

Weakly acyclic games are a super-class of FIPG. Here it is not necessarily true that any 
sequence of improving moves must converge to an equilibrium but we have that from any 
initial state there exists some sequence of improving moves which enforces convergence. A 
subclass of WAG are games where from any initial state there exists a sequence of best 
response moves, which leads to an equilibrium. We call those games weakly acyclic under 
best response, BR- WAG for short. 

The above mentioned classes of finite strategic games are related as follows: 

poly-FIPG C FIPG C BR- WAG C WAG. 

The story does not end here. Very recently, Apt and Simon [3j have classified WAG in much 
more detail by introducing a "scheduler" , which is a moderating super-player who guides the 
agents towards an equilibrium. Moreover, Fabrikant et al. [To] showed that the existence of 
a pure Nash equilibrium in every subgame implies weak acyclicity. 

1.3 Related Work 

The original model of Network Creation Games, which we call the Sum-BG, was introduced 
a decade ago by Fabrikant et al. |llj. Their motivation was to understand the creation of 
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Internet-like networks by selfish agents without central coordination. In the following years, 
several variants were proposed: The Max-BG [8], the StJM-SG and the Max-SG |2|, the 



SUM-ASG and the Max-ASG [14], the Sum-GBG and the Max-GBG 13 , a bounded budget 



version [9] and an edge-restricted version (4j[7j. All these works, as well as several others, 
e.g. [I] and [l5j, focus exclusively on properties of stable networks or on the complexity of 
computing an agent's best response. To the best of our knowledge, the dynamic behavior of 
most of these variants, including best response dynamics in the well-studied original model, 
has not yet been analyzed. 

In earlier work [12] we analyzed the game dynamics of the Sum-SG and showed that if the 
initial network Go is a tree on n nodes, then the network creation process is guaranteed to 
converge m 0{n 3 ) steps. It follows that the Sum-SG on trees is a poly-FIPG. It was shown 
that by employing the max cost policy, this process can be sped up significantly to 0{n) 
steps, which is asymptotically optimal. For the Sum-SG on general graphs we showed that 
there exist initial networks for which there exists a cyclic sequence of best response moves. 
This implies that the Sum-SG on arbitrary initial networks is not a FIPG. 

Very recently, Cord-Landwehr et al. [6] studied a variant of the Max-SG, where agents 
have communication interests, and showed that this variant admits a best response cycle on 
a tree network as initial network. Hence the restricted-interest variant of the Max-SG is not 
a FIPG - even on trees. 

Brandes et al. [5] were the first to observe that the Sum-BG is not a FIPG and they 
prove this by providing a better response cycle. Very recently, Bilb et al. [4j gave a better 
response cycle for the Max-BG which implies the same statement for this version. Note, 
that both proofs contain agents who perform a sub-optimal move at some step in the better 
response cycle. Hence, these two results do not address the convergence behavior if agents 
play optimally. 



1.4 Our Contribution 

In this work, we study Network Creation Games, as proposed by Fabrikant et al. Ill], and 
several natural variants of this model from a new perspective. Instead of analyzing properties 
of equilibrium states, we apply a more constructive point of view by asking if and how fast 
such desirable states can be found by selfish agents. For this, we turn the original model and 
its variants, which are originally formulated as one-shot simultaneous-move games, into more 
algorithmic models, where moves are performed sequentially. 

For the Max Swap Game on trees, we show that the process must converge in C(n 3 ) steps, 
where n is the number of agents. Furthermore, by introducing a natural way of coordination 
we obtain a significant speed-up to 0(nlogn) steps, which is almost optimal. We show that 
these results, combined with results from our earlier work [12], give the same bounds for the 
Asymmetric Swap Game on trees in both the Sum- and the MAX-version. 

These positive results for initial networks which are trees are contrasted by several strong 
negative results on general networks. We show that the Max-SG, the Sum-ASG and the 
Max-ASG on general networks are not guaranteed to converge if agents repeatedly perform 
best possible improving moves and, even worse, that no move policy can enforce convergence. 
We show that these games are not in FIPG, which implies that there cannot exist a generalized 
ordinal potential function which "guides" the way towards an equilibrium state. For the Sum- 
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ASG we show the even stronger negative result that it can happen that no sequence of best 
response moves may enforce convergence, that is, the Sum-ASG is not even weakly acyclic 
under best response. If not all possible edges can be created, that is if we have a non- 
complete host graph (4j[7j, then we show that the Sum-ASG and the Max-ASG on non-tree 
networks is not weakly acyclic. Moreover, we map the boundary between convergence and 
non-convergence in ASGs and show the surprising result that cyclic behavior can already 
occur in n-vertex networks which have n edges. That is, even one non- tree edge suffices to 
completely change the dynamic behavior of these games. In our constructions we have that 
every agent owns exactly one edge, which is equivalent to the uniform- budget case introduced 
by Ehsani et al. [9j. In their paper [9j the authors raise the open problem of determining 
the convergence speed for the bounded-budget version. Thus, our results answer this open 
problem - even for the simplest version of these games - in the negative, since we show that 
no convergence guarantee exists. 

Finally, we provide best response cycles for all versions of the Buy Game, which implies 
that these games have no convergence guarantee - even if agents have the computational 
resources to repeatedly compute best response strategies. To the best of our knowledge, the 
existence of best response cycles for all these versions was not known before. This addresses 
a proposed extension of the original model [IT] , where the networks evolve in stages with new 
players arriving in every stage and all stages should be equilibria. We show that even within 
such a stage the desired stabilization, apart from computational issues, may not happen. 
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Table 1: Summary of results for the SuM-version. BB abbreviates "bounded-budget" [9] 
BRC abbreviates "best response cycle" . 
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Table 2: Summary of results for the Max- version. 



2 Dynamics in Max Swap Games 

In this section we focus on the game dynamics of the Max-SG. Interestingly, we obtain 
results which are very similar to the results shown in our earlier work [12] but we need 
entirely different techniques to derive them. Omitted proofs can be found in the Appendix. 
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2.1 Dynamics on Trees 

We will analyze the network creation process in the Max-SG when the initial network is a 
tree. We prove that this process has the following desirable property: 

Theorem 1. The Max-SG on trees is guaranteed to converge in C(n 3 ) steps to a stable 
network. That is, the M.AX-SG on trees is a poly-FIPG. 

Before proving Theorem [IJ we analyze the impact of a single edge-swap. 

Let T = (V, E) be a tree on n vertices and let agent v be unhappy in network T. Assume 
that agent v can decrease her cost by performing the edge-swap vu to vw, for some u, w G V. 
This swap transforms T into the new network T 1 = (V, (E \ {vu}) U {vw}). Let oriv) = 
maxj-gy^) oIt(v,x) denote agent v's cost in the network T. Let ct>(u) denote her respective 
cost in T'. Let A denote the tree of T" = (V,E \ {vu}) which contains v and let B be the 
tree of T" which contains u and w. It is easy to see, that we have dx(x,y) = dr'{x,y), if 
x, y G V(A) or if x, y G V(B). 

Lemma 1. For all x G V(A) there is no y G V{A) such that ct(x) = dx{x,y). 

Proof. By assumption, agent v can decrease her cost by swapping the edge vu to edge vw, 
where u,w G V(B). We have that dr(x, v) < ct(v), for all x G V(A), since otherwise this 
swap would not change agent v's cost. It follows that for arbitrary x, y G V(A) we have 

dr(x,y) < d T (x,v) + d T (v,y) < d T (x,v) +c T (v). 

Let z G V{B) be a vertex having maximum distance to v in T, that is, ct(v) = dr(v, z). The 
above implies that dx(x,y) < dx{x,z) = ct(x), for all x,y G V(A). □ 

The above lemma directly implies the following statement: 

Corollary 1. For all x G V(A), we have ct(x) > ct>(x). 

Hence, we have that agent v's improving move decreases the cost for all agents in V(A). For 
agents in V(B) this may not be true: The cost of an agent y G V(B) can increase by agent 
v's move. Interestingly, the next result guarantees that such an increase cannot be arbitrarily 
high. 

Lemma 2. Let x G V(A) and y G V(B) such that dT'(x,y) = CT'(y)- It holds that ct(x) > 
CT'(y)- 

Proof. In tree T we have ct{x) = dx{x,v) + dx(u,z) + 1. Furthermore, in tree T' we have 
c T'{y) = dr'(x,v) + dr'(w,y) + 1. Since ct[v) > ct'{v), it follows that dx(w,y) < dr{u,z), 
where z G V(B) is a vertex having maximum distance to v in T. Hence, we have 

c T (x) - c T '(y) = d T (u, z) - d T (w,y) > 0. 

□ 

Towards a generalized ordinal potential function we will need the following: 
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Definition 1 (Sorted Cost Vector and Center- Vertex) . Let G be any network on n vertices. 
The sorted cost vector of G is cq = (7 G , . . • , Jq), where 7 G is the cost of the agent, who has 
the i-th highest cost in the network G. An agent having cost j G is called center- vertex of G. 

Lemma 3. Let T be any tree on n vertices. The sorted cost vector ofT induces a generalized 
ordinal potential function for the M.AX-SG on T. 

Proof. Let v be any agent in T, who performs an edge-swap which strictly decreases her cost 
and let T 1 denote the network after agent v's swap. We show that 

C T {v) - C T '(v) > => Ct >1cx c t >, 

where >i ex is the lexicographic order on N n . The existence of a generalized ordinal potential 
function then follows by mapping the lexicographic order on N n to an isomorphic order on 
R. 

Let the subtrees A and B be defined as above and let ct{v) — ct>(v) > 0. By Lemma [T] 
and Lemma [2j we know that there is an agent x G V(A) such that ct(x) > CT>(y), for all 
y G V{B). By Lemma [T] and Corollary [TJ we have that ct(x) > ct<(x), which implies that 
CT >lex c T '. □ 

Lemma 4. Let T = (V,E) be a connected tree on n vertices having diameter D > 4. After 
at most nD ^ D moves of the M.AX-SG on T one agent must perform a move which decreases 
the diameter. 

For proving Lemma [4] we need two easy-to-see properties of the sorted cost vector and another 
observation about center- vertices. 



Observation 2. Let G be any connected network on n nodes and let cq = (7g> • • • >7g) ^ e 
its sorted cost vector. We have 7 G = j G and 7^. 



Definition 2 (Longest Path). Let G be any connected network. Let v be any agent in G 
having cost cq{v) = k. Any simple path in G, which starts at v and has length k is called a 
longest path of agent v. 

As we will see, center-vertices and longest paths are closely related. 

Lemma 5. Let T be any connected tree and let v* be a center-vertex of T. Vertex v* must 
lie on all longest paths of all agents in V{T). 

Proof. Let P xy denote the path from vertex x to vertex y in T. We assume towards a 
contradiction that there are two vertices v,w £ V{T), where ct{v) = dx{v,w), and that 
v* $l V(P VW ). Let z £ V(T) be the only shared vertex of the three paths P vv * , P wv * , P vw . 
We have dT(v,z) < dx(v,v*) < ct(v*) and dT(w,z) < dT{w,v*) < ct(v*). We show that 
ct(z) < ct(v*), which is a contradiction to v* being a center- vertex in T. 

Assume that there is a vertex u G V(T) with dx(u, z) > ct(v*). It follows that V(P VZ ) n 
V{Pzu) = {z}, since otherwise dT(v*,u) = dT(v*,z) + dr(z,u) > ct(v*). But now, since 
dx(z,w) < ct{v*) < dx(z,u), we have driven) > ct(v), which clearly is a contradiction. 
Hence, we have dT(z,u) < ct(v*), for all u G V(T), which implies that ct(z) < ct(v*). □ 



Proof of Lemma^ Let v, w G V such that dx(v, w) = D > 4 and let P utu be the path from v 
to w in T. Clearly, if no agent in V(P VW ) makes an improving move, then the diameter of the 
network does not change. On the other hand, if the path P vw is the unique path in T having 
length D, then any improving move of an agent in V{P VW ) must decrease the diameter by at 
least 1. The network creation process starts from a connected tree having diameter D > 4 
and, by Lemma [3j must converge to a stable tree in a finite number of steps. Moreover, 
Lemma [3] guarantees that the diameter of the network cannot increase in any step of the 
process. It was shown by Alon et al. [2] that any stable tree has diameter at most 3. Thus, 
after a finite number of steps the diameter of the network must strictly decrease, that is, on 
all paths of length D some agent must have performed an improving move which reduced the 
length of the respective path. We fix the path P vw to be the path of length D in the network 
which survives longest in this process. 

It follows, that there are \ V\ V{P VW )\ = n — (D + 1) agents which can perform improving 
moves without decreasing the diameter. We know from Observation [2] and Lemma [5] that 
each one of those n — (D + 1) agents can decrease her cost to at most \^\ + 1 and has 
to decrease her cost by at least 1 for each edge-swap. We show that an edge-swap of such 
an agent does not increase the cost of any other agent and use the minimum possible cost 
decrease per step to conclude the desired bound. 

Let u G V(T) \ V(P VW ) be an agent who decreases her cost by swapping the edge ux to 
uy and let T' be the tree after this edge-swap. Let a, b G V(T) be arbitrary agents. Clearly, 
if {u,y} % V{P a b) in T, then d T (a,b) = d T ,(a,b). Let A be the tree of T" = (V, E \ {uy}) 
which contains u and let B be the tree of T" which contains y. W.l.o.g. let a G V(A) and 
b G V(B). By Corollary [TJ we have ct{z) > ct'(z) for all z G V(A) and it follows that 
V(A) n V(P VW ) = 0- Hence, it remains to analyze the change in cost of all agents in V(B). 

If no vertex on the path P a b is a center- vertex in T', then, by Lemma [5| we have that 
d,T'(a,b) < CT'(b). It follows that every longest path of agent b in T' lies entirely in subtree 
B which implies that cy/(6) < cy(6). 

If there is a center- vertex of T' on the path P a b in T', then let v* be the last such vertex 
on this path. We have assumed that the diameters of T' and T are equal, which implies that 
P vw is a longest path of agent v in T' . Since, by Lemma [5| any center- vertex of T' must lie 
on all longest paths, it follows that v* is on the path P vw and we have v* G V(B). W.l.o.g. 
let d T >(v,b) > d T '(w,b). We have d T <{a,b) = d T *(a,v*) + d T *(v*,b) < d T >(v,v*) +d T <{v*,b). 
Hence, we have dy(a, b) < CT>{b). Since the path Pb v is in subtree B, we have CT'(b) < cxib). 

Now we can easily conclude the upper bound on the number of moves which do not 
decrease the diameter of T. Each of the n— (D + 1) agents with cost at most D may decrease 
their cost to \§] +1. If we assume a decrease of 1 per step, then this yields the following 
bound: 



(n-(D + l))[D 



D 



, , , . D nD-D 
+ 1) ) <(n-D)- = — 



2 



□ 

Finally, we can set out to proving Theorem [TJ 

Proof of Theorem^ By Lemma [3j we know there exists a generalized ordinal potential func- 
tion for the Max-SG on trees. Hence, we know that this game is a FIPG and we are left 
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to bound the maximum number of improving moves needed for convergence. It was already 
shown by Alon et al. 12], that the only stable trees of the Max-SG on trees are stars or 
double-stars. Hence, the process must stop at the latest when diameter 2 is reached. 

Let N n (T) denote the maximum number of moves needed for convergence in the Max- 
SG on the n-vertex tree T. Let D(T) be the diameter of T. Let Di )TL denote the maximum 
number of steps needed to decrease the diameter of any n-vertex tree having diameter i by 
at least 1. Hence, we have 

D(T) n-l 
N n (T)< ^%<^A,n, 
i=3 i=3 

since the maximum diameter of a n-vertex tree is n — 1 . By applying Lemma [4] and adding 
the steps which actually decrease the diameter, this yields 

i—l n— 1 / ■ -2 \ / n \ ~\ / n \ 

N n ( T ) < y: A,n < E + 1 <«+? D - E»1 € 

i=3 i=3 ^ ' \i=l / \i=l / 

□ 

Now let us analyze the impact of enforced coordination by a suitable move policy. We have 
the following result, which is close to optimal, since it is easy to see that there are instances 
in which Q(n) steps are necessary: 

Theorem 3. The Max-SG on trees with the max cost policy as move policy converges in 
0(nlogn) moves. 

We prove Theorem [3] by proving the lower and the upper bound separately, starting with 
the former. Since we analyze the max cost policy, we need two additional observations. 

Observation 4. An agent having maximum cost in a tree T must be a leaf ofT. 

Observation 5. Let u be an unhappy agent inT = (V, E) and let u be a leaf ofT and let v be 
u's unique neighbor. Let B be the tree ofT' = (V, E\{uv}) which contains v. The edge-swap 
uv to uw, for some w £ V(B) is a best possible move for agent u if w is a center-vertex of B. 

Lemma 6. There is a tree T onn vertices where the Max-SG on T with the max cost policy 
needs O(nlogn) moves for convergence. 

Proof. We consider the path on n- vertices P„ = V1V2 ■ ■ ■ v n of length n — 1. We apply the max 
cost policy and for breaking ties we will always choose the vertex having the smallest index 
among all vertices having maximum cost. If a maximum cost vertex has more than one best 
response move, then we choose the edge-swap towards the new neighbor having the smaller 
index. With these assumptions and with Observation [4] and Observation [5j we have that the 
center-vertex having the smallest index will "shift" towards a higher index, from to 
v n -2- Finally, agent v n is the unique agent having maximum cost and her move transforms 
the tree to a star. See Fig. [T]for an illustration for n = 9. 

We start by analyzing the change in costs of agent v\. Clearly, co = cp n (vi) = n — 1. 
By Observation [5] we know that v\s best swap connects to the minimum index center- 
vertex of the tree without vertex v\. Hence after the best move of v\ this agent has cost 
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Figure 1: The convergence process for with n = 9 in the Max-SG on P, 



ci = r^V^l + 1 > 2 1 - When i>i is chosen to move again, her cost can possibly decrease to 



+ 1 > ft. Thus, the 



[" Cl 2 1 ] + 1 > 2a. After the i-th move of v\ her cost is at least — I — - 

max cost policy allows agent v\ to move at least log y times until she is connected to vertex 
v n -2, the center of the final star, where she has cost 3. 

The above implies, that the number of moves of every agent allowed by the max cost 
policy only depends on the cost of that agent when she first becomes a maximum cost agent. 
Moreover, since all moving agents are leaves, no move of an agent increases the cost of any 
other agent. By construction, the cost of every moving agent is determined by her distance 
towards vertex v n . Since agent v n does not move until in the last step of the process, we 
have that a move of agent Vi does not change the cost of any other agent Vj ^ v n who moves 
after V{. It follows, that we can simply add up the respective lower bounds on the number of 
moves of all players, depending on the cost when they first become maximum cost agents. It 
is easy to see, that agent V{ becomes a maximum cost agent, when the maximum cost is n — i. 
Let M(P n ) denote the number of moves of the Max-SG on P n with the max cost policy and 
the above tie-breaking rules. This yields 

4 

M{P n )> lQ g? G n(nlogn). 

co=n— 1 

□ 

Lemma 7. The Max-SG on a n-vertex tree T with the max cost policy needs O(nlogn) 
moves to converge to a stable tree. 

Proof. Consider any tree T on n vertices. By Observation |4j we know that only leaf-agents 
are allowed to move by the max cost policy, which implies that no move of any agent increases 
the cost of any other agent. Observation [5] guarantees that the best possible move of a leaf- 
agent u having maximum cost c decreases agent u's cost to at most |~|] + 1. Hence, after 
O(logn) moves of agent u her cost must be at most 3. If the tree converges to a star, then 
agent u may move one more time. If we sum up over all n agents, then we have that after 
O(nlogn) moves the tree must be stable. □ 
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2.2 Dynamics on General Networks 

In this section we show that allowing cycles in the initial network completely changes the 
dynamic behavior of the Max-SG. 

Theorem 6. The M.AX-SG on general networks admits best response cycles. Moreover, no 
move policy can enforce convergence. The first result holds true even if agents are allowed to 
swap more than one edge at a time. 

Proof of Theorem [6| We prove the theorem by showing that there exists an initial network 
which induces a best response cycle and where in every step of the cycle exactly one agent is 
unhappy. The existence of the best response cycle shows that the Max-SG on this instance 
does not have the finite improvement property. The fact that in every step of the cycle exactly 
one agent is unhappy shows that no move policy can avoid that cyclic behavior. In every step, 
swapping one edge suffices to achieve the best possible cost decrease for the moving agent. 
Hence, there exists a best response cycle even if agents are allowed to perform multi-swaps. 
However, note that with multi-swaps it is no longer true that there is only one unhappy agent 
in every step. 






(1) 



(2) 



(3) 



Figure 2: The steps of a best response cycle for the Max-SG on general networks. 

Consider the initial network G\ which is depicted in Fig. [2] (1). Note, that only agents 
01,03,63 and C3 have cost 3 while all other agents have cost 2. Clearly, agents having cost 
2 cannot improve on their current situation. Agents 03,63,03 cannot perform an improving 
move, since all of them have exactly two vertices in distance 3 and there is no vertex which 
is a neighbor of both of them. This leaves agent a\ as the only possible candidate for an 
improving edge-swap. A best possible move for agent a\ is the swap ai&i to a\C\, which yields 
a distance decrease of 1 which is clearly optimal. This swap transforms Gi into G2, which is 
depicted in Fig. [2] (2). Observe, that G2 is isomorphic to G\, with agent b\ facing exactly the 
same situation as agent a\ in G\. Agent h\ has the swap b\C\ to b\a\ as best response move 
and we end up in network G3, shown in Fig. [2] (3). Again, G3 is isomorphic to G±, now with 
agent c\ being the unique unhappy agent. Agent ci's best possible swap transforms G3 back 
into G%. □ 
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3 Dynamics in Asymmetric Swap Games 



In this section we consider the Sum-ASG and the Max-ASG. Note, that now we assume that 
each edge has an owner and only this owner is allowed to swap the edge. We show that we 
can directly transfer the results from above and from [12] to the asymmetric version if the 
initial network is a tree. On general networks we show even stronger negative results. 

Observe, that the instance used in the proof of Theorem [6] and the corresponding instance 
in |12| show that best response cycles in the Swap Game are not necessarily best response 
cycles in the Asymmetric Swap Game. We will show the rather counter-intuitive result that 
this holds true for the other direction as well. Hence, in general we cannot simply transfer 
the cyclic instances from one model to the other even though they are closely related. 



3.1 Asymmetric Swap Games on Trees 



The results in this section follow from the respective theorems in 12 and from the results in 
Section [2.11 and are therefore stated as corollaries. 



Corollary 2. The S\JM-ASG and the M.AX-ASG on n-vertex trees are both a poly-FIPG and 
both must converge to a stable tree in 0(n 3 ) steps. 



Proof. It was shown in 12 that the Sum-SG on trees is an ordinal potential game, where 
the social cost, which is the sum of all agent's costs, serves as ordinal potential function. 
Furthermore, it was shown that the Sum-SG on n-vertex trees must converge in 0(n 3 ) steps. 
Note, that ordinal potential games are a subclass of FIPG [l~6] . 

The only difference in the asymmetric version of this game is that edges have owners and 
only the respective owner is allowed to swap an edge. Clearly, since any improving swap 
decreases the value of the (generalized) potential function, this is independent of the edge- 
ownership. Furthermore, with edges having owners, we have that in each network less moves 
are possible and every moving agent has, compared with the Swap Game, at most the same 
number of admissible strategies in any step. Thus, the convergence process cannot be slower. 
The results from |12| and Theorem [I] then yields the desired statement. □ 

Corollary 3. Using the max cost policy and assuming a n-vertex tree as initial network, we 
have that 

• the Sum-ASG converges in max{0, n— 3} steps, ifn is even and in max{0, n+ \n/2\ —5} 
steps, if n is odd. Moreover, both bounds are tight and asymptotically optimal. 

• the M.AX-ASG converges in O(nlogn) steps. 



Proof. We can carry over the results from [12] and Section 2.1 about speeding up the con- 
vergence process by a suitable move policy. The reason for this is that in all used lower 
bound constructions it holds that whenever an edge is swapped more than once, then it is 
the same incident agent who moves again. Hence, we can assign the edge-ownership to this 
agent and get the same lower bounds in the asymmetric version. The upper bounds carry 
over trivially, since agents cannot have more admissible new strategies in any step in the 
asymmetric version compared to the version without edge-owners. □ 
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3.2 Sum Asymmetric Swap Games on General Graphs 

If we move from trees to general initial networks, we get a very strong negative result for the 
Sum-ASG: There is no hope to enforce convergence if agents stick to playing best responses 
even if multi-swaps are allowed. 

Theorem 7. The Sum-ASG on general networks is not weakly acyclic under best response. 
Moreover, this result holds true even if agents can swap multiple edges in one step. 

Proof. We give a network which induces a best response cycle. Additionally, we show that 
in each step of this cycle exactly one agent can decrease her cost by swapping an edge and 
that the best possible swap for this agent is unique in every step. Furthermore, we show 
that the moving agent cannot outperform the best possible single-swap by a multi-swap. 
This implies that if agents stick to best response moves then no best response dynamic can 
enforce convergence to a stable network and allowing multi-swaps does not alter this result. 




Figure 3: The steps of a best response cycle for the Sum-ASG on general networks. Note, 
that edge directions indicate edge-ownership. All edges are two-way. 

The best response cycle consists of the networks Gi,G2,Gs and G4 given in Fig. [3j We 
begin with showing that in G±, . . . , G4 all agents, except agent b and agent /, cannot perform 
an improving strategy change even if they are allowed to swap multiple edges in one step. 

In Gi, . . . , Ga all leaf-agents do not own any edges and the agents c and e cannot swap 
an edge since otherwise the network becomes disconnected. For the same reason, agent d 
cannot move the edge towards d\. Agent d owns three other edges, but they are optimally 
placed since they are connected to the vertices having the most leaf- neighbors. It follows, 
that agent d cannot decrease her cost by swapping one edge or by performing a multi-swap. 
Note, that this holds true for all networks G\, . . . , G4, although the networks change slightly. 
Agent a cannot move her edges towards a^, for 1 < i < 4. On the other hand, it is easy to 
see that agent a's edge towards vertex e cannot be swapped to obtain a strict cost decrease 
since the most promising choice, which is vertex c, yields the same cost in G\ and G4 and 
even higher cost in G2 and G3. Trivially, no multi-swap is possible for agent a. 

Now, we consider agent b and agent /. First of all, observe that in G%, . . . , G4 agent / 
owns exactly one edge which is not a bridge. Thus, agent / cannot perform a multi-swap 
in any step of the best response cycle. Agent 6, although owning three edges, is in a similar 
situation: Her edges to vertex c and e can be considered as fixed, since swapping one or 
both of them does not yield a cost decrease in G%, . . . , G4. Hence, agent b and agent / each 
have one "free" edge to operate with. In G\ agent 6's edge towards / is placed optimally, 
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since swapping towards a or d does not yield a cost decrease. In G3, agents 6's edge towards 
a is optimal, since swapping towards d or / does not decrease agent 6's cost. Analogously, 
agent /'s edge towards e in G2 and her edge towards d in G4 are optimally placed. 

Last, but not least, we describe the best response cycle: In G\ agent / can improve and 
her unique best possible edge-swap in G\ is the swap from d to e, yielding a cost decrease 
of 4. In Gi agent 6 has the swap from / to a as unique best improvement which yields a cost 
decrease of 1. In G3 have agent / being unhappy with her strategy and the unique best swap 
is the one from e to d yielding an improvement of 1. In G4 it is agent 6's turn again and her 
unique best swap is from a to / which decreases her cost by 3. After agent 6's swap in G4 
we arrive again at network G\ , hence G\ , . . . , G^ is a best response cycle where in each step 
exactly one agent has a single-swap as unique best possible improvement. □ 

Remark 1. Note, that the best response cycle presented in the proof of Theorem^ is not 
a best response cycle in the Sum-SG. The swap fb to fe of agent f in G\ yields a strictly 
larger cost decrease than her swap fd to fe. 

Corollary 4. The Sum-^^G on a non-complete host graph is not weakly acyclic. 

Proof. We use the best response cycle G\, . . . , G4 shown in Fig. [3] and let the host graph H 
be the complete graph but without the edge {a,/}. In this case, agent /'s best response 
move in G\ is her only possible improving move. For the networks G2, G3 and G4 it is easy 
to check, that the respective moving player has exactly one possible improving move. □ 

3.3 Max Asymmetric Swap Games on General Graphs 

Compared to Theorem [7J we show a slightly weaker negative result for the max- version of 
Asymmetric Swap Games. 

Theorem 8. The M.AX-ASG on general networks admits best response cycles. Moreover, no 
move policy can enforce convergence. 

Proof. We show that there exists a best response cycle for the Max-ASG, where no move 
policy can enforce convergence. Our cycle, shown in Fig. |4j has six steps G\, . . . ,Gq. In G3 
and Gq there are two unhappy agents whereas in the other steps there is exactly one unhappy 
agent. It turns out, that independently which one of the two agent moves in G3 or Gq, there 
is a best response move which leads back to a network in the cycle. This implies, that no 
move policy may enforce convergence. 

First of all, note that the networks G2 and G5 are isomorphic. The same holds true for 
the networks G3 and Gq. We start by showing that in the networks G±, . . . , G4 only the 
highlighted agents are unhappy. Then we will analyze the best response moves of the unhappy 
agents in the respective networks. 

Consider any Gi, where 1 < % < 4. Clearly, no leaf-agent of Gi can perform any swap. 
Agent g has cost 3 in Gi. We have dc^g, h) = dG t (g, h) and there is no vertex in Gi which is 
a neighbor to both l\ and Z5. Thus, agent g cannot achieve cost 2, which implies that agent 
g does not want to perform a move in Gi. By the same argument, it follows that any agent 
having cost 3 must be happy. Hence, we have that agent 6 in G\ and G2 and agent c in G3 
and G4 do not swap. 
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Agent d, having cost 4 in Gi, cannot perform an improving move in Gi, since do i (d, li) = 
d>Gi(d,l4) = 4 and any such improving move must connect to a vertex which has distance 
at most 2 to vertex l\. These vertices are c,j,k and l\ but any of them has distance at 
least 3 towards I4 in Gi. Agents / and i, both having cost 4 in Gi, each face an analogous 
situation, since dGi(f,h) = ^G;(Mi) = ^GiC/j^s) = dc^i,^ = 4 and since vertices c,j,k,l\ 
have distance at least 3 to I5 or 1% in any Gi. 

Agent h, having cost 4 in Gi has both l\ and I2 in distance 4. The only vertex which has 
distance at most 2 to both of them is vertex j. But if h swaps towards vertex j, then this 
yields distance 4 towards towards vertex I3. 

Now we consider agent e. Any strategy yielding cost at most 3 for agent e must connect 
to c, j, k or h, since otherwise vertex l\ would be in distance 4. But, since dc^e, 1 4) = 4 and 
since c,j,k,l\ all have distance at least 3 towards I4, no such strategy can exist. For agent 
j, having cost 4 in Gi. the situation is similar. Any strategy having cost at most 3 for agent 
j must connect to a, 6, h, i or I5, since otherwise j's distance towards Z5 would be 4. But if j 
swaps away from g to any vertex in {a, b, h, i, 1$}, then her distance to Z3 increases to 4. 

Agent k, having cost 4 in Gi, has vertex I4 and Z5 in distance 4. Thus, to achieve as cost 
of at most 3, agent k must connect to vertex h or i. Both h and i have distance at least 3 
towards I3. Since dG t (k, I3) = 4, it follows, that agent h cannot perform an improving move. 

Now, we are left with agent c in Gi, agent a in G2 and agent b in G4, all having cost 
4. We have dc 1 (c, I2) = 4. Thus, any strategy which yields cost at most 3 must connect 
to d,e,j or 1%. If c swaps away from g, then her distance to I4 increases to 4. If c swaps 
away from b, then her distance to l§ increases to 4. Hence, agent c cannot swap an edge to 
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decrease her cost in G±. For agent a in G2 and agent b in G4 the situation is similar. We 
have d(3 2 (a,/i) = dc 2 (o;h) = ^G 4 (Mi) = dc A {b,h) = 4. Both agents cannot decrease their 
cost by swapping an edge in the respective network, since all vertices which have distance at 
most 2 to l\ have distance at least 3 to I3. 

Finally, we analyze the best response moves of all unhappy agents in any Gi. In G\ only 
agent a, having cost 5, is unhappy. There is only one vertex having distance 5 to a, which 
is l\. Thus, agent could swap towards any vertex having distance at most 3 to l\ to improve 
on this distance. Possible target-vertices are b,c,e, g, j,k and l\. But, after any such swap, 
agent a must have distance 4 to vertex Z3, which implies, that agent a cannot decrease her 
cost by more than 1. Furthermore, swapping towards c, k or l\ yields distance 5 towards I3, 
which rules out these target vertices. If agent a performs the swap ad to ae in G\, then we 
obtain network G<i- In G2 only vertex c, having cost 4, is unhappy. Her unique vertex in 
distance 4 is h- Thus, a swap towards a,d,e,j or I2 would reduce this distance. However, 
only the swap towards a is an improving move, since in all other cases agent c's distance to 
^6 increases to 4. This swap transforms G2 into G3. In G3 we have that agent a and agent 
b are unhappy. Agent b in G3 is in a similar situation as agent a in G\. Her only vertex 
in distance 5 is the leaf l\ and by swapping towards a,c,e,g,j,k or l\ this distance can be 
reduced. But any such swap yields that agent b's distance to Z3 increases to at least 4, which 
implies that a cost decrease by 1 is optimal. Furthermore, the swaps towards c, k or l\ are 
not improving moves since these yield distance 5 towards I3. If agent b performs the swap bd 
to be we obtain network G4. Agent a in G3 has Z3 as her only vertex in distance 4. There 
is exactly one swap for agent a, which decreases this distance to 3 without increasing any 
other distance to more than 3, and this is the swap ae to ad. Note, that this swap of agent a 
transforms G3 to a network which is isomorphic to network G\. Finally, we argue for agent 
o in G4. This agent is in a similar situation as agent a in G3. Her only vertex in distance 4 
is I3 and the swap towards d is the only swap which does not increase any other distance to 
more than 3. Thus, this move is agent a's unique best response in G4 and this move leads to 
network G5. □ 

Corollary 5. The M.AX-ASG on a non-complete host graph is not weakly acyclic. 

Proof. We use the best response cycle G\, . . . , Gq from Fig. [4} As host graph H we use the 
complete graph, but without edges {a, b}, {a, g}, {a,j}, {b,g}, {b, j}. By inspecting the proof 
of Theorem [8j it is easy to see that in each step of the cycle the moving agent has exactly 
one improving move. □ 

3.4 The Boundary between Convergence and Non-Convergence 

In this section we explore the boundary between guaranteed convergence and cyclic behavior. 
Quite surprisingly, we can draw a sharp boundary by showing that the undesired cyclic 
behavior can already occur in n-vertex networks having exactly n edges. Thus, one non- 
tree edge suffices to radically change the dynamic behavior of Asymmetric Swap Games. 
Our constructions are such that each agent owns exactly one edge, which corresponds to 
the uniform unit budget case, recently introduced by Ehsani et al. [9j. Hence, even if the 
networks are build by identical agents having a budget the cyclic behavior may arise. This 
answers the open problem in 19] the negative. 
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Theorem 9. The Svm-ASG admits a best response cycle on a network where every agent 
owns exactly one edge. 



Proof. The network which induces a best response cycle and the steps of the cycle are shown 
in Fig. [5j Let denote the number of vertices having the form kj, for some index j. 




Figure 5: The steps of a best response cycle for the Sum-ASG where each agent owns exactly 
one edge. 

In the first step, depicted in Fig. [5] (1), agent a\ has only one improving move, which is the 
swap from b\ to c\. This swap reduces agent a\s cost by 1, since n c = nb + + 1. After this 
move, shown in Fig. [5] (2), agent b\ is no longer happy with her edge towards d\, since by 
swapping towards 04 she can decrease her cost by 2. This is a best possible move for agent b\ 
(note, that a swap towards 03 yields the same cost decrease). But now, in the network shown 
in Fig. [5] (3), by swapping back towards vertex bi, agent a\ can additionally decrease her 
distances to vertices (14 and (25 by 1. This yields that agent a\s swap from c\ to 61 decreases 
her cost by 1. This is true, since all distances to Cj vertices increase by 1 but all distances 
to bi and di vertices and to 04 and 05 decrease by 1 and since we have n c = n& + 714 + 1. 
Note, that this swap is agent a±s unique improving move. By construction, we have that 
after agent a\ has swapped back towards b\, depicted in Fig. [5] (4), agent b\ 's edge towards 04 
only yields a distance decrease of 7. Hence, by swapping back towards d\, agent b\ decreases 
her cost by 1, since her sum of distances to the dj vertices decreases by 8. This swap is the 
unique improving move of agent b\ in this stage. Now the best response cycle starts over 
again, with agent a\ moving from b\ to c\. □ 

Theorem 10. The yiAX-ASG admits a best response cycle on a network where every agent 
owns exactly one edge. 

Proof. Fig. [6] shows the steps of a best response cycle for the Max version in a network, 
where each agent owns exactly one edge. 

In the first step of the cycle, shown in Fig. [6] (1), agent a\ can decrease her maximum 
distance from 6 to 5 by swapping from e\ to one of the vertices e2, • • • ,e$. Note, that all 
these swaps yield the same distance decrease of 1 and, since a\ has distance 5 towards a% and 
swapping towards any of the aj-vertices is obviously sub-optimal, no other swap can yield a 
larger cost decrease. By agent a\ performing the swap towards es we obtain the network in 
Fig. [6] (2). 

Now, agent b\ can improve her situation with a swap from a\ to a<i or to 03. Both possible 
swaps reduce her maximum distance from 6 to 5. This is best possible: The cycle has length 
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(1) 



(2) 



(3) 



(4) 



Figure 6: The steps of a best response cycle for the Max-ASG where each agent owns exactly 
one edge. 

9 before agent &i's move, which implies that there are two vertices on the cycle which have 
distance 4 to b±. Observe, that agent b\ must swap towards a vertex which has at most 
distance 4 to vertex a& to reduce her maximum distance. Clearly, only one of the Oj vertices, 
with i ^ 1, is possible. However, swapping away from a% must increase the cycle by at least 
1, which implies that after this swap agent b\ must have at least one cycle- vertex in distance 
5. Hence, no swap can decrease agent b\s maximum distance by more than 1. Let agent b± 
perform the swap towards 03 and we end up with the network in Fig. [6] (3). 

Agent ai now finds herself sitting in a large cycle having maximum distance 7 towards 
ds and distance 6 to vertex 64. By swapping from es to one of the vertices ei,e2,e3, agent 
a\ can reduce her maximum distance to 6. This is optimal, since all improving moves must 
swap towards a vertex having at most distance 5 to vertex d% and agent a\ cannot move 
to far away from vertex cq. The vertices ei,e2,e3 are the only vertices which satisfy both 
conditions. Let a\ swap towards e\ and we get the network depicted in Fig. [6] (4). 

In the last step of the best response cycle, we have agent b\ with maximum distance 8 
towards vertex e§. Clearly, agent 61 wants to move closer to this vertex but this implies, that 
she must move away from vertex a§. The only possible compromise between both distances 
is a swap either to a\ or to e\. Both these swaps yield a decrease of &i's maximum distance 
by 1. By swapping towards vertex a±, we end up with our starting configuration and the 
cycle is complete. □ 

Remark 2. We can give best response cycles for both versions of the ASG for the case where 
every agent owns exactly two edges. We conjecture, that such cyclic instances also exist for 
all cases where every agent owns exactly k edges, for any k > 3. In particular, it would be 
interesting if there is a generic construction which works for all k > 1. 
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4 Dynamics in (Greedy) Buy Games 



We focus on the dynamic behavior of the Buy Game and the Greedy Buy Game. Remember, 
that we assume, that each edge can be created for the cost of a > 0. We show that best 
response cycles exist, even if arbitrary strategy-changes are allowed. However, on the positive 
side, we were not able to construct best response cycles where only one agent is unhappy 
in any step. Hence, the right move policy may have a substantial impact in (Greedy) Buy 
Games. 

All the above constructions of best response cycles cannot be used for our purpose, since 
a has to be chosen high enough such that no agent wants to buy any additional edge, but at 
the same time low enough, such that all cycles in the network remain in place. 

Theorem 11. The S\JM-GBG and the Sxjm-BG admit best response cycles. 

Proof. We prove both statements by giving a best response cycle, where the best response of 
any moving agent consists of either buying, deleting or swapping one edge. The best response 
cycle Gi, . . . , Gq, for 7 < a < 8, is depicted in Fig. [7j 




(1) (2) (3) 




(4) (5) (6) 

Figure 7: The steps of a best response cycle for the Stjm-(G)BG for 7 < a < 8. 

We analyze the steps of the cycle and show, that the indicated strategy-change is indeed a 
best response move - even if there are no restrictions on the admissible strategies. 

In network G±, it is obvious, that agent g is unhappy with her situation. The indicated 
swap gf to gc decreases agent g's cost from 21 to 15. This is a best possible move, which 
can be seen as follows. Clearly, deleting her unique own edge would disconnect the network. 
Hence in all optimal strategies agent g must purchase at least one edge. Among all strategies, 
where g buys exactly one edge, that is, among all possible single edge-swaps, we have that 
buying an edge towards a vertex having minimum cost in G\ — g is optimal. Here, G\ — g is 
the network G\ with vertex g removed. Thus, swapping her edge towards vertex c is optimal. 
This swap yields a cost of 15 for agent g. It is easy to see, that no other arbitrary strategy 
can be better, since if agent g buys at least two edges, then her cost is greater than 16, since 
a > 7. After g has performed her strategy-change we obtain network Gi- 

In (?2 we claim that agent / is unhappy and that her best possible move is to buy an 
edge towards vertex b. First of all, this is an improving move, since the edge fb decreases 
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her cost from 19 to 11 + a, which is a strict cost decrease since a < 8. The target vertex 
b is optimal, since connecting to c yields the same cost and connecting to any other vertex 
yields a higher cost. Clearly, agent / cannot delete or swap any edges. Furthermore, buying 
at least two edges yields cost of more than 19, since 2a > 14 and there are six other vertices 
in G2 to which / has distance at least 1. The edge purchase of agent / leads us to network 
G 3 . 

In network G3 we claim that agent c is unhappy and that her best possible move is to 
delete her edge towards b. Agent c has cost 9+a in G3. Deleting edge cb yields cost 16 < 9+a, 
since a > 7. Clearly, no strategy which buys at least two edges can be optimal for agent c, 
since 6 + 2a > 16. On the other hand, swapping her unique edge away from b must increase 
agent c's cost since at least one distance increases to 3. If agent c deletes her edge cb, then 
we obtain network G4. 

In G4, we have that agent g is in a similar situation as she was in G\. Agent g is again 
a leaf- vertex of a path of length 6. Thus, by an analogous argument as for g in G%, we have 
that the swap gc to gf is a best possible move for agent g in G4. This move leads us to 
network G5. 

In network G5 we have that agent c is in a similar situation as agent / in G2 ■ By analogous 
arguments, it follows that buying the edge towards b is a best possible move of agent c in G5. 
This edge-purchase transforms G5 into Gq. 

Finally, in network Gq we have that agent / is in a similar situation as agent c in G 3 . 
Thus, by analogous arguments, we have that deleting her edge fb is an optimal move for 
agent / in Gq. This deleting transforms Gq into Gi and we have completed the cycle. □ 

Theorem 12. The Max-GBG and the Max-BG admit best response cycles. 

Proof. We give a best response cycle, where in every step of the cycle the moving agent has 
a best response which consists of a greedy move, that is, the strategy-change is the addition, 
deletion or swap of one edge. The best response cycle G±, . . . , G4, for 1 < a < 2, can be seen 
in Fig. |8| 




(1) (2) (3) (4) 

Figure 8: The steps of a best response cycle for the Max-(G)BG for 1 < a < 2. 

We show for each step of the cycle, that the indicated moving agent indeed performs a 
best response move which transforms the network of one step into the network of the next 
step of the cycle. 
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In network G\ we claim that agent g is unhappy and that a best response move is to buy 
the edge ga. Clearly, agent g cannot delete or swap any edges. Hence, it suffices to analyze 
all buy or multi-buy operations. Agent g has cost 5 in G\. The purchase of edge ga is an 
improving move, since with this yields a distance cost of 3 for agent g and since a < 2. Note, 
that agent g must buy at least one edge to reduce her distance cost in G\. Furthermore, it is 
easy to see that with one additional edge a distance cost of 3 is best possible. Now, observe, 
that if agent g buys more than one edge, than her distance cost may decrease, but it cannot 
decrease by more than 1 per edge. Since a > 1, no strategy which buys at least two edges 
can yield strictly less cost than 3 + a. The indicated move of agent g transforms G± into G2. 

In G2, agent e, having cost 4, is unhappy with her situation. Buy buying the edge ea, 
agent e can decrease her distance cost to 2. Since a < 2, it follows that this is a improving 
move. Note, that agent e cannot delete or swap any edges and that distance cost of 2 is 
optimal, unless agent e buys 5 edges, which clearly is too expensive. Hence, buying the edge 
ea is a best response move for agent e and this move leads to network G3. 

In network G3, we have that agent g, having cost 3 + a, is unhappy. By deleting her own 
edge ga, agent g can achieve a cost of 4, which is strictly less than 3 + a, since a > 1. We 
claim that deleting edge ga is a best response move for agent g. If agent g swaps her unique 
own edge, than she cannot achieve a distance cost of less than 3. Thus, no swap can be an 
improving move. If agent g buys at least one additional edge, then such a purchase may 
decrease agent g's distance cost by 1 per edge, but since a > 1, this cannot outperform her 
current strategy in G3. Thus, deleting edge ga is the only improving move of agent g and, 
thus, must be her best response move. This move transforms network G3 in to G4. 

Finally, in network G4 we have that agent e, having cost 3 + a, is unhappy. Deleting 
her edge ea yields a cost of 4, which is strictly less than 3 + a. Swapping this edge cannot 
decrease her distance cost below 3, which rules out any edge-swaps. If agent e buys at least 
one additional edge, then she can reduce her distance cost by at most 1 per additional edge. 
Clearly, no such strategy may yield less cost than 3 + a and, thus, cannot be an improving 
move. Hence, deleting her edge ea is her unique improving move, which must be her best 
response move. This move transforms G4 into G\. □ 
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